AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
TRL Fine-tuning

# TRL Fine-tuning

Qwen3 8B Grpo Medmcqa
A fine-tuned version based on Qwen/Qwen3-8B using the medmcqa-grpo dataset, specialized in medical multiple-choice question answering tasks
Large Language Model Transformers
Q
mlxha
84
1
Deepseek R1 Chinese Law
Apache-2.0
Llama model trained with Unsloth and Huggingface TRL library, achieving 2x faster inference speed
Large Language Model Transformers English
D
corn6
74
2
Travelbot
Apache-2.0
Llama model trained with Unsloth and Huggingface TRL library, achieving 2x inference speed improvement
Large Language Model Transformers English
T
kitty528
9,146
2
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase